AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
TensorRT Optimization

# TensorRT Optimization

Llama 3.1 8B Instruct FP8
FP8 quantized version of Meta Llama 3.1 8B Instruct model, featuring an optimized transformer architecture autoregressive language model with 128K context length support.
Large Language Model Transformers
L
nvidia
3,700
21
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase